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Abstract — We propose distributed algorithms to automatically 
deploy a team of mobile robots to partition and provide coverage 
of a non-convex environment. To handle arbitrary non-convex 
environments, we represent them as graphs. Our partitioning and 
coverage algorithm requires only short-range, unreliable pairwise 
"gossip" communication. The algorithm has two components: (1) 
a motion protocol to ensure that neighboring robots communicate 
at least sporadically, and (2) a pairwise partitioning rule to update 
territory ownership when two robots communicate. By studying 
an appropriate dynamical system on the space of partitions of the 
graph vertices, we prove that territory ownership converges to 
a pairwise-optimal partition in finite time. This new equilibrium 
set represents improved performance over common Lloyd-type 
algorithms. Additionally, we detail how our algorithm scales well 
for large teams in large environments and how the computation 
can run in anytime with limited resources. Finally, we report on 
large-scale simulations in complex environments and hardware 
experiments using the Player/Stage robot control system. 




Fig. 1. Example of a team of robots providing efficient coverage of a non- 
convex environment, as measured by an appropriate multicenter cost function. 
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I. Introduction 

Coordinated networks of mobile robots are already in use 
for environmental monitoring and warehouse logistics. In 
the near future, autonomous robotic teams will revolutionize 
transportation of passengers and goods, search and rescue op- 
erations, and other applications. These tasks share a common 
feature: the robots are asked to provide service over a space. 
One question which arises is: when a group of robots is waiting 
for a task request to come in, how can they best position 
themselves to be ready to respond? 

The distributed environment partitioning problem for robotic 
networks consists of designing individual control and com- 
munication laws such that the team divides a large space into 
regions. Typically, partitioning is done so as to optimize a cost 
function which measures the quality of service provided over 
all of the regions. Coverage control additionally optimizes the 
positioning of robots inside a region as shown in Fig. [T] 

This paper describes a distributed partitioning and coverage 
control algorithm for a network of robots to minimize the 
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expected distance between the closest robot and spatially 
distributed events which will appear at discrete points in a non- 
convex environment. Optimality is defined with reference to a 
relevant "multicenter" cost function. As with all multirobot co- 
ordination applications, the challenge comes from reducing the 
communication requirements: the proposed algorithm requires 
only short-range "gossip" communication, i.e., asynchronous 
and unreliable communication between nearby robots. 

Literature Review 

Territory partitioning and coverage control have applica- 
tions in many fields. In cyber-physical systems, applications 
include automated environmental monitoring (TJ, fetching and 
delivery (2), construction [3|, and other vehicle routing scenar- 
ios JH . More generally, coverage of discrete sets is also closely 
related to the literature on data clustering and /c-means 0, 
as well as the facility location or /c-center problem 0. 
Partitioning of graphs is its own field of research, see (7) for a 
survey. Territory partitioning through local interactions is also 
studied for animal groups, see for example (8). 

A broad discussion of algorithms for partitioning and 
coverage control in robotic networks is presented in l9l 
which builds on the classic work of Lloyd |[T0l on optimal 
quantizer selection through "centering and partitioning." The 
Lloyd approach was first adapted for distributed coverage 
control in [11]. Since this beginning, similar algorithms have 
been applied to non-convex environments liT2l . |[T3lL unknown 
density functions |[T4l . |[T5lL equitable partitioning |[T6l . and 
construction of truss-like objects 0. There are also multi- 
agent partitioning algorithms built on market principles or 
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auctions, see ifTTl for a survey. 

While Lloyd iterative optimization algorithms are popular 
and work well in simulation, they require synchronous and 
reliable communication among neighboring robots. As robots 
with adjacent regions may be arbitrarily far apart, these 
communication requirements are burdensome and unrealistic 
for deployed robotic networks. In response to this issue, in fT8l 
the authors have shown how a group of robotic agents can 
optimize the partition of a convex bounded set using a Lloyd 
algorithm with gossip communication. A Lloyd algorithm with 
gossip communication has also been applied to optimizing 
partitions of non-convex environments in |[T9lL the key idea 
being to transform the coverage problem in Euclidean space 
into a coverage problem on a graph with geodesic distances. 

Distributed Lloyd methods are built around separate parti- 
tioning and centering steps, and they are attractive because 
there are known ways to characterize their equilibrium sets 
(the so-called centroidal Voronoi partitions) and prove con- 
vergence. Unfortunately, even for very simple environments 
(both continuous and discrete) the set of centroidal Voronoi 
partitions may contain several sub-optimal configurations. We 
are thus interested in studying (discrete) gossip coverage 
algorithms for two reasons: (1) they apply to more realistic 
robot network models featuring very limited communication in 
large non-convex environments, and (2) they are more flexible 
than typical Lloyd algorithms meaning they can avoid poor 
suboptimal configurations and improve performance. 

Statement of Contributions 

There are three main contributions in this paper. First, 
we present a discrete partitioning and coverage optimization 
algorithm for mobile robots with unreliable, asynchronous, and 
short-range communication. Our algorithm has two compo- 
nents: a motion protocol which drives the robots to meet their 
neighbors, and a pairwise partitioning rule to update territories 
when two robots meet. The partitioning rule optimizes cover- 
age of a set of points connected by edges to form a graph. The 
flexibility of graphs allows the algorithm to operate in non- 
convex, non-polygonal environments with holes. Our graph 
partition optimization approach can also be applied to non- 
planar problems, existing transportation or logistics networks, 
or more general data sets. 

Second, we provide an analysis of both the convergence 
properties and computational requirements of the algorithm. 
By studying a dynamical system of partitions of the graph's 
vertices, we prove that almost surely the algorithm converges 
to a pairwise-optimal partition in finite time. The set of 
pairwise-optimal partitions is shown to be a proper subset 
of the well- studied set of centroidal Voronoi partitions. We 
further describe how our pairwise partitioning rule can be 
implemented to run in anytime and how the computational 
requirements of the algorithm can scale up for large domains 
and large teams. 

Third, we detail experimental results from our implementa- 
tion of the algorithm in the Player/Stage robot control system. 
We present a simulation of 30 robots providing coverage of a 
portion of a college campus to demonstrate that our algorithm 



can handle large robot teams, and a hardware-in-the-loop 
experiment conducted in our lab which incorporates sensor 
noise and uncertainty in robot position. Through numerical 
analysis we also show how our new approach to partitioning 
represents a significant performance improvement over both 
common Lloyd-type methods and the recent results in lfT8l . 

The present work differs from the gossip Lloyd method f]~8l 
in three respects. First, while iTTSl focuses on territory par- 
titioning in a convex continuous domain, here we operate 
on a graph which allows our approach to consider geodesic 
distances, work in non-convex environments, and maintain 
connected territories. Second, instead of a pairwise Lloyd-like 
update, we use an iterative optimal two-partitioning approach 
which yields better final solutions. Third, we also present a 
motion protocol to produce the sporadic pairwise commu- 
nications required for our gossip algorithm and characterize 
the computational complexity of our proposal. Preliminary 
versions of this paper appeared in fT9l and l20ll . Compared to 
these, the new content here includes: (1) a motion protocol; 
(2) a simplified and improved pairwise partitioning rule; (3) 
proofs of the convergence results; and (4) a description of our 
implementation and a hardware-in-the-loop experiment. 

Paper Structure and Notation 

In Section III we review and adapt coverage and geometric 
concepts (e.g., centroids, Voronoi partitions) to a discrete envi- 
ronment like a graph. We formally describe our robot network 
model and the discrete partitioning problem in Section [Till and 
then state our coverage algorithm and its properties. Section [TV] 
contains proofs of the main convergence results. In Section IVl 
we detail our implementation of the algorithm and present 
experiments and comparative analysis. Some conclusions are 
given in Section [Vll 

In our notation, R>o denotes the set of non-negative real 
numbers and Z>o the set of non-negative integers. Given a 
set A, \A\ denotes the number of elements in A. Given sets 
A, B, their difference is A \ B = {a e A \ a ^ B}. A set- 
valued map, denoted by T : A =4 B, associates to an element 
of A a subset of B. 

II. Preliminaries 

We are given a team of N robots tasked with providing 
coverage of a finite set of points in a non-convex and non- 
polygonal environment. In this Section we translate concepts 
used in coverage of continuous environments to graphs. 

A. Non-convex Environment as a Graph 

Let Q be a finite set of points in a continuous environment. 
These points represent locations of interest, and are assumed 
to be connected by weighted edges. Let G(Q) = (Q,E,w) 
be an (undirected) weighted graph with edge set E C Q x Q 
and weight map w : E —> M>o; we let w e > be the weight 
of edge e. We assume that G(Q) is connected and think of 
the edge weights as distances between locations. 

Remark 2.1 (Discretization of an Environment): For the 
examples in this paper we will use a coarse occupancy grid 



map as a representation of a continuous environment. In 
an occupancy grid [21 1 , each grid cell is either free space 
or an obstacle (occupied). To form a weighted graph, each 
free cell becomes a vertex and free cells are connected with 
edges if they border each other in the grid. Edge weights 
are the distances between the centers of the cells, i.e., the 
grid resolution. There are many other methods to discretize a 
space, including triangularization and other approaches from 
computational geometry l22ll . which could also be used. 

In any weighted graph G(Q) there is a standard notion of 
distance between vertices defined as follows. A path in G is 
an ordered sequence of vertices such that any consecutive pair 
of vertices is an edge of G. The weight of a path is the sum 
of the weights of the edges in the path. Given vertices h and 
k in G, the distance between h and k, denoted dc(h,k), is 
the weight of the lowest weight path between them, or +00 if 
there is no path. If G is connected, then the distance between 
any two vertices in G is finite. By convention, dc(h, k) = 
if h = k. Note that dc(h, k) = dc(k, h), for any h,k G Q. 

B. Partitions of Graphs 

We will be partitioning Q into N connected subsets or 
regions which will each be covered by an individual robot. 
To do so we need to define distances on induced subgraphs of 
G(Q). Given I C Q, the subgraph induced by the restriction 
of G to I, denoted by GDI, is the graph with vertex set 
equal to I and edge set containing all weighted edges of 
G where both vertices belong to I. In other words, we set 
(Q, E,w)Hl= (Qnl,En (Ixl), w\ lxl ). The induced sub- 
graph is a weighted graph with a notion of distance between 
vertices: given h,k G I, we write di(h,k) := dcni{h^k). 
Note that d 7 (ft, k) > d G (h, k). 

We define a connected subset of Q as a subset A c Q such 
that A 7^ and G D A is connected. We can then partition Q 
into connected subsets as follows. 

Definition 2.2 (Connected Partitions): Given the graph 
G(Q) = (Q, E, w), we define a connected N— partition of Q 
as a collection P = {Pi}f =1 of N subsets of Q such that 

© Uti p i = Q> 

(ii) PiHPj = if i + j; 

(iii) Pi ^ for all i G {1, . . . , N}; and 

(iv) Pi is connected for all i G {1, ... , N}. 

Let Part n (Q) to be the set of connected TV— partitions of Q. 

Property (ii) implies that each element of Q belongs to just 
one Pi, i.e., each location in the environment is covered by 
just one robot. Notice that each Pi G P induces a connected 
subgraph in G(Q). In subsequent references to Pi we will 
often mean Gf)Pi, and in fact we refer to P%{t) as the 
dominance subgraph or region of the i-th robot at time t. 

Among the ways of partitioning Q, there are some which 
are worth special attention. Given a vector of distinct points 
c G Q N , the partition P G Part n (Q) is said to be a Voronoi 
partition of Q generated by c if, for each Pi and all k G Pi, 
we have q G Pi and dc(k, q) < dc(k, cf), Vj 7^ i. Note that 
the Voronoi partition generated by c is not unique since how 
to apportion tied vertices is unspecified. 



C. Adjacency of Partitions 

For our gossip algorithms we need to introduce the notion 
of adjacent subgraphs. Two distinct connected subgraphs Pi, 
Pj are said to be adjacent if there are two vertices qi, qj 
belonging, respectively, to Pi and Pj such that (qi,qj) G E. 
Observe that if Pi and Pj are adjacent then Pi U Pj is 
connected. Similarly, we say that robots i and j are adjacent 
or are neighbors if their subgraphs Pi and Pj are adjacent. 
Accordingly, we introduce the following useful notion. 

Definition 2.3 (Adjacency Graph): For P G Partjv(Q), we 
define the adjacency graph between regions of partition P as 
Q{P) = ({1,...,7V},£(P)), where (ij) G £{P) if P< and 
Pj are adjacent. 

Note that G(P) is always connected since G{Q) is. 

D. Cost Functions 

We define three coverage cost functions for graphs: H on e, 
Hmuiticenter, and ^expected- Let me weight function (j) : Q -» R >0 
assign a relative weight to each element of Q. The one-center 
function % nt gives the cost for a robot to cover a connected 
subset A C Q from a vertex h G A with relative prioritization 
set by 

7^oc(h;A) = ^d A (h,fc)0(fe). 

A technical assumption is needed to solve the problem of 
minimizing % ont (- , A): we assume from now on that a total 
order relation, <, is defined on Q, i.e., that Q = {1,...,|Q|}. 
With this assumption we can deterministically pick a vertex 
in A which minimizes H ont as follows. 

Definition 2.4 (Centroid): Let Q be a totally ordered set, 
and let A c Q. We define the set of generalized centroids of 
A as the set of vertices in A which minimize H om , i.e., 

C(A) := argminH ne(^;^4). 

heA 

Further, we define the map Cd as Cd(A) := min{c G C(A)}. 
We call Cd(A) the generalized centroid of A. 

In subsequent use we drop the word "generalized" for 
brevity. Note that with this definition the centroid is well- 
defined, and also that the centroid of a region always belongs 
to the region. With a slight notational abuse, we define 
Cd : Part at (Q) — » Q N as the map which associates to a 
partition the vector of the centroids of its elements. 

We define the multicenter function Hmuiticenter to measure the 
cost for N robots to cover a connected TV-partition P from 
the vertex set c G Q N : 

1 N 

^multicenter (c, P) = ^one^i; Pj)- 

We aim to minimize the performance function H mu iticenter with 
respect to both the vertices c and the partition P. 

We can now state the coverage cost function we will be 
concerned with for the rest of this paper. Let H ex pected • 
Part at (Q) —> M>o be defined by 

^expected (P) ~ ^multicenter (Cd(P) , P) . 



In the motivational scenario we are considering, each robot 
will periodically be asked to perform a task somewhere in its 
region with tasks appearing according to distribution (j). When 
idle, the robots would position themselves at the centroid of 
their region. By partitioning G so as to minimize ^expected* me 
robot team would minimize the expected distance between a 
task and the robot which will service it. 
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(a) (b) (c) 



Fig. 2. All possible centroidal Voronoi partitions of a uniform 2x5 grid. 
Assuming all edge weights are w and all vertices have priority 1, then (a) 
has a cost of 1.2w, (b) has a cost of l.lw, and (c) has a cost of l.Ow. Only 
(c) is pairwise-optimal by definition. 



E. Optimal Partitions 

We introduce two notions of optimal partitions: centroidal 
Voronoi and pairwise-optimal. Our discussion starts with the 
following simple result about the multicenter cost function. 

Proposition 2.5 (Properties of Multicenter Function): Let 
P G Part n (Q) and c G Q N . If P' is a Voronoi partition 
generated by c and d G Q n is such that c- G C(P^) V i, then 

^multicenter (c, P') < ^multicenter (c, P), and 
^multicenter (c , P) 5^ ^multicenter (c, P) . 

The second inequality is strict if any a £ C(P$). 

Proposition 12.51 implies the following necessary condition: 
if (c, P) minimizes ^multicenter, then q G C(P^) Vz and P 
must be a Voronoi partition generated by c. Thus, H ex pected 
has the following property as an immediate consequence of 
Proposition 12.51 given P G Part/v(Q), if P* is a Voronoi 
partition generated by Cd(P) then 

^expected (P ) ^ ^expected 

This fact motivates the following definition. 

Definition 2.6 (Centroidal Voronoi Partition): 
P G Part n (Q) is a centroidal Voronoi partition of Q 
if there exists a c G Q n such that P is a Voronoi partition 
generated by c and C{ G C(P^) V i. 

The set of pairwise-optimal partitions provides an alterna- 
tive definition for the optimality of a partition: a partition is 
pairwise-optimal if, for every pair of adjacent regions, one can 
not find a better two-partition of the union of the two regions. 
This condition is formally stated as follows. 

Definition 2.7 (Pairwise-optimal Partition): 
P G Part at (Q) is a pairwise-optimal partition if for 
every G £(P), 

H ne(Cd(P,); Pi) + n onQ (Cd(Pj)- Pj) = 

L n U n „ 1 min{dp iU p i (a,A:),dp,up i (&,A:)}0(^) L 

The following Proposition states that the set pairwise- 
optimal partitions is in fact a subset of the set of centroidal 
Voronoi partitions. The proof is involved and is deferred to 
Appendix O See Fig. [2] for an example which demonstrates 
that the inclusion is strict. 

Proposition 2.8 (Pairwise-optimal Implies Voronoi): Let 
P G Part at (Q) be a pairwise-optimal partition. Then P is 
also a centroidal Voronoi partition. 

For a given environment Q, a pair made of a centroidal 
Voronoi partition P and the corresponding vector of centroids 
c is locally optimal in the following sense: H ex pected cannot be 



reduced by changing either P or c independently. A pairwise- 
optimal partition achieves this property and adds that for every 
pair of neighboring robots (i,j), there does not exist a two- 
partition of Pi UPj with a lower coverage cost. In other words, 
positioning the robots at the centroids of a centroidal Voronoi 
partition (locally) minimizes the expected distance between a 
task appearing randomly in Q according to relative weights <p 
and the robot who owns the vertex where the task appears. 
Positioning at the centroids of a pairwise-optimal partition 
improves performance by reducing the number of sub-optimal 
solutions which the team might converge to. 

III. Models, Problem Formulation, and Proposed 
Solution 

We aim to partition Q among N robotic agents using 
only asynchronous, unreliable, short-range communication. In 
Section IIII-AI we describe the computation, motion, and com- 
munication capabilities required of the team of robots, and in 
Section HTl-B I we formally state the problem we are addressing. 
In Section IIII-CI we propose our solution, the Discrete Gossip 
Coverage Algorithm, and in llll-Dl we provide an illustration. In 
Sections fill-El and IlII-FI we state the algorithm's convergence 
and complexity properties. 

A. Robot Network Model with Gossip Communication 

Our Discrete Gossip Coverage Algorithm requires a team 
of N robotic agents where each agent i G {1, . . . , N} has the 
following basic computation and motion capabilities: 
(CI) agent i knows its unique identifier z; 
(C2) agent i has a processor with the ability to store G(Q) 

and perform operations on subgraphs of G(Q); and 
(C3) agent i can determine which vertex in Q it occupies and 

can move at speed v along the edges of G(Q) to any 

other vertex in Q. 

Remark 3.1 (Localization): The localization requirement in 
(C3) is actually quite loose. Localization is only used for nav- 
igation and not for updating partitions, thus limited duration 
localization errors are not a problem. 

The robotic agents are assumed to be able to communicate 
with each other according to the range-limited gossip commu- 
nication model which is described as follows: 
(C4) given a communication range r comm > max eG£ ; w e , when 
any two agents reside for some positive duration at a 
distance r < r comm , they communicate at the sample 
times of a Poisson process with intensity A comm > 0. 
Recall that an homogeneous Poisson process is a widely- 
used stochastic model for events which occur randomly and 
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independently in time, where the expected number of events 
in a period A is AA comm . 

Remark 3.2 (Communication Model): (1) This commu- 
nication capability is the minimum necessary for our algo- 
rithm, any additional capability can only reduce the time 
required for convergence. For example, it would be acceptable 
to have intensity A(r) depend upon the pairwise robot distance 
in such a way that A(r) > A com m for r < r com m- 
(2) We use distances in the graph to model limited range 
communication. These graph distances are assumed to ap- 
proximate geodesic distances in the underlying continuous 
environment and thus path distances for a diffracting wave 
or moving robot. 

B. Problem Statement 

Assume that, for all t G M>o, each agent i G {1, . . . , N} 
maintains in memory a connected subset Pi (t) of environment 
Q. Our goal is to design a distributed algorithm that iteratively 
updates the partition P(t) = {Pi{t)}f =1 while solving the 
following optimization problem: 



min Hexpected(^), 
PGPartiv(Q) 



(1) 



subject to the constraints imposed by the robot network model 
with range-limited gossip communication from Section IIII-A1 

C. The Discrete Gossip Coverage Algorithm 

In the design of an algorithm for the minimization prob- 
lem (Q]) there are two main questions which must be addressed. 
First, given the limited communication capabilities in (C4), 
how should the robots move inside Q to guarantee frequent 
enough meetings between pairs of robots? Second, when 
two robots are communicating, what information should they 
exchange and how should they update their regions? 

In this section we introduce the Discrete Gossip Coverage 
Algorithm which, following these two questions, consists of 
two components: 

(1) the Random Destination & Wait Motion Protocol, and 

(2) the Pairwise Partitioning Rule. 

The concurrent implementation of the Random Destination 
& Wait Motion Protocol and the Pairwise Partitioning Rule 
determines the evolution of the positions and dominance 
subgraphs of the agents as we now formally describe. We start 
with the Random Destination & Wait Motion Protocol. 

Random Destination & Wait Motion Protocol 

Each agent i G {1,...,7V} determines its 
motion by repeatedly performing the following 
actions: 

1: agent i samples a destination vertex qi from a uniform 
distribution over its dominance subgraph Pi, 

2: agent i moves to vertex qi through the shortest path in Pi 
connecting the vertex it currently occupies and qu and 

3: agent i waits at qi for a duration r > 0. 

If agent i is moving from one vertex to another we say that 
agent i is in the moving state while if agent i is waiting at 
some vertex we say that it is in the waiting state. 



Remark 3.3 (Motion Protocol): The motion protocol is de- 
signed to ensure frequent enough communication between 
pairs of robots. In general, any motion protocol can be used 
which meets this requirement, so i could select from the 
boundary of Pi or use some heuristic non-uniform distribution 
over Pi. 

If any two agents i and j reside in two vertices at a graphical 
distance smaller that r com m for some positive duration, then at 
the sample times of the corresponding communication Poisson 
process the two agents exchange sufficient information to 
update their respective dominance subgraphs Pi and Pj via 
the Pairwise Partitioning Rule. 

Pairwise Partitioning Rule 

Assume that at time t G M>o, agent i and agent j communi- 
cate. Without loss of generality assume that i < j. Let P%{t) 
and Pj(t) denote the current dominance subgraphs of i and 
j, respectively. Moreover, let t + denote the time instant just 
after t. Then, agents i and j perform the following tasks: 
1: agent i transmits Pi(t) to agent j and vice-versa 
2: initialize W a * := Pi(t), W b * := Pj(t), a* := Cd(P i (t)), 

6* :=Cd(P^)) 
3: compute U := Pi(t) UPj(t) and an ordered list S of all 

pairs of vertices in U 
4: for each (a, b) G S do 
5: compute the sets 

W a := {x G U : du(x,a) < du(x,b)} 
Wb := {x G U : du(x,a) > du(x,b)} 
6: if n om (a;W a ) + H one (b-W b ) < 

Hone (a* ; W a * ) + Hone (6* ; W b * ) then 
7: W a * := W a , W b * := W b , a* := a, 6* := b 

8: Pi(t+) :=W a *, P d (t+) :=W b * 

Some remarks are now in order. 

Remark 3.4 (Partitioning Rule): (1) The Pairwise Parti- 
tioning Rule is designed to find a minimum cost two-partition 
of U. More formally, if list S and sets W a * and W b * for 
(a*, 6*) G S are defined as in the Pairwise Partitioning Rule, 
then W a * and W b * are an optimal two-partition of U. 

(2) While the loop in steps 4-7 must run to completion to 
guarantee that W a * and W b * are an optimal two-partition of 
U, the loop is designed to return an intermediate sub-optimal 
result if need be. If Pi and Pj change, then H ex pected will 
decrease and this is enough to ensure eventual convergence. 

(3) We make a simplifying assumption in the Pairwise 
Partitioning Rule that, once two agents communicate, the 
application of the partitioning rule is instantaneous. We discuss 
the actual computation time required in Section HIT-Fl and some 
implementation details in Section [Vj 

(4) Notice that simply assigning W a * to i and W b * to j can 
cause the robots to "switch sides" in U. While convergence is 
guaranteed regardless, switching may be undesirable in some 
applications. In that case, any smart matching of W a * and W b * 
to i and j may be inserted. 

(5) Agents who are not adjacent may communicate but the 
partitioning rule will not change their regions. Indeed, in this 
case W a * and W b * will not change from Pi(t) and Pj(t). 

Some possible modifications and extensions to the algorithm 
are worth mentioning. 
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Fig. 3. Simulation of four robots dividing a square environment with obstacles. The boundary of each robots territory is drawn in a different color, the 
centroid of a territory is drawn with an X, and pairwise communication is drawn with a solid red line. On the left is the initial partition assigned to the robots. 
The middle frames show two pairwise territory exchanges, with updated territories highlighted with solid colors. The final partition is shown at right. 



Remark 3.5 (Heterogeneous Robotic Networks): In case 
the robots have heterogeneous dynamics, line 5 can be 
modified to consider per-robot travel times between vertices. 
For example, du(x,a) could be replaced by the expected 
time for robot i to travel from a to x while du(x,b) would 
consider robot j. 

Remark 3.6 (Coverage and Task Servicing): Here we fo- 
cus on partitioning territory, but this algorithm can easily be 
combined with methods to provide a service in Q as in (4). 
The agents could split their time between moving to meet their 
neighbors and update territory, and performing requested tasks 
in their region. 

D. Illustrative Simulation 

The simulation in Fig. [3] shows four robots partitioning a 
square environment with obstacles where the free space is 
represented by a 12 x 12 grid. In the initial partition shown 
in the left panel, the robot in the top right controls most of 
the environment while the robot in the bottom left controls 
very little. The robots then move according to the Random 
Destination & Wait Motion Protocol, and communicate ac- 
cording to range-limited gossip communication model with 
t 'comm = 2.5m (four edges in the graph). 

The first pairwise territory exchange is shown in the second 
panel, where the bottom left robot claims some territory from 
the robot on the top left. A later exchange between the two 
robots on the top is shown in the next two panels. Notice 
that the cyan robot in the top right gives away the vertex it 
currently occupies. In such a scenario, we direct the robot to 
follow the shortest path in G(Q) to its updated territory before 
continuing on to a random destination. 

After 9 pairwise territory exchanges, the robots reach the 
pairwise-optimal partition shown at right in Fig. [51 The ex- 
pected distance between a random vertex and the closest robot 
decreases from 2.34m down to 1.74m. 

E. Convergence Property 

The strength of the Discrete Gossip Coverage Algorithm is 
the possibility of enforcing that a partition will converge to a 
pairwise-optimal partition through pairwise territory exchange. 
In Theorem [3?7l we summarize this convergence property, with 
proofs given in Section [iVl 

Theorem 3.7 (Convergence Property): Consider a network 
of N robotic agents endowed with computation and motion ca- 
pacities (CI), (C2), (C3), and communication capacities (C4). 



Assume the agents implement the Discrete Gossip Coverage 
Algorithm consisting of the concurrent implementation of the 
Random Destination & Wait Motion Protocol and the Pairwise 
Partitioning Rule. Then, 

(i) the partition P(t) remains connected and is described by 
P : R> H> PartAr(Q), and 

(ii) P(t) converges almost surely in finite time to a pairwise- 
optimal partition. 

Remark 3.8 (Optimality of Solutions): By definition, a 
pairwise-optimal partition is optimal in that H ex pected can not 
be improved by changing only two regions in the partition. 

Remark 3.9 (Generalizations): For simplicity we assume 
uniform robot speeds, communication processes, and wait- 
ing times. An extension to non-uniform processes would be 
straightforward. 

F. Complexity Properties and Discussion 

In this subsection we explore the computational require- 
ments of the Discrete Gossip Coverage Algorithm, and make 
some comments on implementation. Cost function H ne(^; Pi) 
is the sum of the distances between h and all other vertices 
in Pi. This computation of one-to-all distances is the core 
computation of the algorithm. For most graphs of interest the 
total number of edges \E\ is proportional to \Q\, so we will 
state bounds on this computation in terms of \Pi\. Computing 
one-to-all distances requires one of the following: 

• if all edge weights in G(Q) are the same (e.g., for a graph 
from an occupancy grid), a breadth-first search approach 
can be used which requires 0(\Pi\) in time and memory; 

• otherwise, Dijkstra's algorithm must be used which re- 
quires 0(\Pi \ log (| Pi I)) in time and G(\Pi\) in memory. 

Let 3 (Pi) be the time to compute one-to-all distances in Pi, 
then computing H om (h; Pi) requires 0{D(Pi)) in time. 

Proposition 3.10 (Complexity Properties): The motion 
protocol requires G(\Pi\) in memory, and 0(B)(Pi)) in 
computation time. The partitioning rule requires 0(\Pi | + \Pj \) 
in communication bandwidth between robots i and j, 
0(\Pi \ + \Pj\) in memory, and can run in any time. 

Proof: We first prove the claims for the motion protocol. 
Step 2 is the only non-trivial step and requires finding a 
shortest path in Pi, which is equivalent to computing one-to- 
all distances from the robot's current vertex. Hence, it requires 
0(B(Pi)) in time and 0(Pi) in memory. 
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We now prove the claims for the partitioning rule. In step 
1, robots i and j transmit their subgraphs to each other, 
which requires 0(|P;| + \Pj\) in communication bandwidth. 
For step 3, the robots determine U := PiUPj, which requires 
C(\Pi | + \P 3 ; |) in memory to store. Step 4 is the start of a loop 
which executes 0(\U\ 2 ) times, affecting the time complexity 
of steps 5, 6 and 7. Step 5 requires two computations of 
one-to-all distances in U which each take 0(B (£/)). Step 6 
involves four computations of H nt over different subsets of U, 
however those for W a * and Wb* can be stored from previous 
computation. Since W a and Wb are strict subsets of U, step 
5 takes longer than step 6. Step 7 is trivial, as is step 8. The 
total time complexity of the loop is thus 0(\U\ 2 B ([/")). 

However, the loop in steps 4-7 can be truncated after any 
number of iterations. While it must run to completion to 
guarantee that W a * and Wb* are an optimal two-partition of 
U, the loop is designed to return an intermediate sub-optimal 
result if need be. If Pi and Pj change, then H exp ected will 
decrease. Our convergence result will hold provided that all 
elements of S are eventually checked if Pi and Pj do not 
change. Thus, the partitioning rule can run in any time with 
each iteration requiring 0(D) ([/")). ■ 

All of the computation and communication requirements in 
Proposition 13.101 are independent of the number of robots and 
scale with the size of a robot's partition, meaning the Discrete 
Gossip Coverage Algorithm can easily scale up for large teams 
of robots in large environments. 



IV. Convergence Proofs 

This section is devoted to proving the two statements in 
Theorem 13.71 The proof that the Pairwise Partitioning Rule 
maps a connected A^-partition into a connected -partition is 
straightforward. The proof of convergence is more involved 
and is based on the application of Lemma lAJl in Appendix lAl 
to the Discrete Gossip Coverage Algorithm. Lemma IA.1I 
establishes strong convergence properties for a particular class 
of set valued maps (set-valued maps are briefly reviewed in 
Appendix lAl). 

We start by proving that the Pairwise Partitioning Rule is 
well-posed in the sense that it maintains a connected partition. 

Proof of Theorem \3. 71 statement (0): To prove the state- 
ment we need to show that P(t + ) satisfies points (i) through 
(iv) of Definition 12.21 From the definition of the Pairwise 
Partitioning Rule, we have that Pi (t+ ) UPj (t+ ) = P { (t) UPj (t) 
and Pi(t+) H Pj(t + ) = 0. Moreover, since a* G Pi(t + ) and 
6* G Pj(t+), it follows that ^ and Pj(t+) ^ 0. 

These observations imply the validity of points (i), (ii), and 
(iii) for P(£ + ). Finally, we must show that P;(t + ) and Pj(t+) 
are connected, i.e., P(t + ) also satisfies point (iv). To do 
so we show that, given x G W a *, any shortest path in 
Pi(t) UPj(t) connecting x to a* completely belongs to W a *. 
We proceed by contradiction. Let s Xja * denote a shortest path 
in Pi (t) U Pj (t) connecting x to a* and let us assume that 
there exists m G s XyCb * such that m G Wb*. For m to be 
in W b * means that d P .( t ) UP . (t) (ra, &*) < d P . (t)uP . (£) (ra, a*). 



This implies that 

d PiUPj (x,b*) < dp iUPj (m,b*) + d PiUPj (x,m) 
< d PiUPj (ra, a*) + dp. uPj (x, m) 
= d PiUPj (x,a*). 

This is a contradiction for x G W a *. Similar considerations 
hold for W b * . ■ 
The rest of this section is dedicated to proving convergence. 
Our first step is to show that the evolution determined by 
the Discrete Gossip Coverage Algorithm can be seen as a 
set- valued map. To this end, for any pair of robots (i, j) G 
{1, . . . , N} 2 , i ^ j, we define the map T^- : Partjv(Q) — » 
Part at (Q) by 

T, i (P) = (P 1 ,...,P,,...,P i ,...,P iV ), 

where Pi = W a * and Pj = Wb*. 

If at time t G M>o the pair and no other pair of robots 
perform an iteration of the Pairwise Partitioning Rule, then the 
dynamical system on the space of partitions is described by 

P(t+) = Ta (P(t)) . (2) 

We define the set- valued map T : Part at (Q) =4 Part at (Q) as 

T(P) = {T l3 {P) | G {1, . . . , TV} 2 , i + j}. (3) 

Observe that © can then be rewritten as P(t+) G T(P(t)). 

The next two Propositions state facts whose validity is 
ensured by Lemma IB. II of Appendix [B] which states a key 
property of the Random Destination & Wait Motion Protocol. 

Proposition 4.1 (Persistence of Exchanges): Consider N 
robots implementing the Discrete Gossip Coverage Algorithm. 
Then, there almost surely exists an increasing sequence of 
time instants {tk}kez >0 suc ^ mat = ^7*0^(^0) f° r 

some G £(P(t k )% 

Proof: The proof follows directly from Lemma IB. II 
which implies that the time between two consecutive pairwise 
communications is almost surely finite. ■ 

The existence of time sequence {^} fcGZ>0 allows us to to 
express the evolution generate by the Discrete Gossip Cover- 
age Algorithm as a discrete time process. Let P(k) := P(tk) 
and P(k + 1) := P(^), then 

P(fe + 1) GT(P(/c)) 

where T : Part n (Q) =4 Part n (Q) is defined as in ©. 

Given k G Z>o, let Xk denote the information which 
completely characterizes the state of Discrete Gossip Coverage 
Algorithm just after the k-th iteration of the partitioning rule, 
i.e., at time t£_ v Specifically, 2^ contains the information 
related to the partition P(k), the positions of the robots at 
tfr_ l9 and whether each robot is in the waiting or moving state 
at tfr_ v The following result characterizes the probability that, 
given Xk, the (k + l)-th iteration of the partitioning rule is 
governed by any of the maps T^, (i, j) G £(P(k)). 

Proposition 4.2 (Probability of Communication): Consider 
a team of N robots with capacities (CI), (C2), (C3), and 
(C4) implementing the Discrete Gossip Coverage Algorithm. 
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Then, there exists a real number tt G (0, 1), such that, for any 

k e %>o and e £(P(k)) 

P[P(fe + l) = Ii i (P(fe)) \ X k ] >7f. 

Proof: Assume that at time f one pair of robots commu- 
nicates. Given a pair (i,j) G £(P(t)), we must find a lower 
bound for the probability that is the communicating pair. 
Since all the Poisson communication processes have the same 
intensity, the distribution of the chance of communication 
is uniform over the pairs which are "able to communicate," 
i.e., closer than r comm to each other. Thus, we must only 
show that has a positive probability of being able to 

communicate at time t, which is equivalent to showing that 
is able to communicate for a positive fraction of time 
with positive probability. The proof of Lemma iBJl implies that 
with probability at least a/(l — e _AcommT ) any pair in £(P(t)) 
is able to communicate for a fraction of time not smaller than 
J , where a and A are defined in the proof of Lemma IB. II 
Hence the result follows. ■ 



The property in Proposition 14.21 can also be formulated as 
follows. Let a : Z> -> e {1, . . . , N} 2 , i ^ j} be the 

stochastic process such that a(k) is the communicating pair at 
time k. Then, the sequence of pairs of robots performing the 
partitioning rule at time instants 



can be seen as a 



realization of the process a, which satisfies 

P[a(k + l) = (iJ) \a(k)] >ir 



(4) 



for all (ij) e £(P(k)). 

Next we show that the cost function decreases whenever the 
application of T from © changes the territory partition. This 
fact is a key ingredient to apply Lemma IA.1I 

Lemma 4.3 (Decreasing Cost Function): Let 
P e Part at (Q) and let P+ e T(P). If P+ ^ P, then 

^expected (P~^~) ^ ^expected {P)- 

Proof: Without loss of generality assume that (i,j) is the 
pair executing the Pairwise Partitioning Rule. Then 



^expected (P~^~ ) 



^expected (P) 
= Ho„e(Cd(^ + );/> + ) 

-ft one (Cd(P;);P;) 



" ^one 
^one 



(Cd(P/);P+) 
(Cd^);^). 



According to the definition of the Pairwise Partitioning Rule 
we have that if P^ ^ P u P/ + Pj, then 

H ne(Cd(^ + ); P+) + H ne(Cd(P+); P+) 
<Hone(a*;^ + )+Hone(^;P+) 
< H ne(Cd(P,); Pi) + H one (Cd(Pj); Pj) 

from which the statement follows. ■ 

We now complete the proof of the main result, Theorem 13771 
Proof of Theorem \3.7\ statement Note that the 

algorithm evolves in a finite space of partitions, and by Theo- 
rem \3J\ statement ©, the set Part n (Q) is strongly positively 
invariant. This fact implies that assumption (i) of Lemma IA.1I 
is satisfied. From Lemma 14.31 it follows that assumption (ii) 
is also satisfied, with H ex pected playing the role of the function 
U. Finally, the property in © is equivalent to the property 



of persistent random switches stated in Assumption (iii) of 
Lemma lAJl for the special case h = 1. Hence, we are in the 
position to apply Lemma IA.1I and conclude convergence in 
finite-time to an element of the intersection of the equilibria 
of the maps Tij , which by definition is the set of the pairwise- 
optimal partitions. ■ 

V. Experimental Methods & Results 

To demonstrate the utility and study practical issues of 
the Discrete Gossip Coverage Algorithm, we implemented 
it using the open-source Player/Stage robot control system 
1231 and the Boost Graph Library (BGL) El. All results 
presented here were generated using Player 2.1.1, Stage 2.1.1, 
and BGL 1.34.1. To compute distances in uniform edge weight 
graphs we extended the BGL breadth-first search routine with 
a distance recorder event visitor. 

A. Large-scale Simulation 

To evaluate the performance of our gossip coverage al- 
gorithm with larger teams, we tested 30 simulated robots 
partitioning a map representing a 350m x 225m portion of 
campus at the University of California at Santa Barbara. As 
shown in Fig.H the robots are tasked with providing coverage 
of the open space around some of the buildings on campus, 
a space which includes a couple open quads, some narrower 
passages between buildings, and a few dead-end spurs. For 
this large environment the simulated robots are 2m on a side 
and can move at 3.0 ] f. Each territory cell is 3m x 3m. 

In this simulation we handle communication and partition- 
ing as follows. The communication range is set to 30m (10 
edges in the graph) with A com m = Q.3 22 ^. The robots wait 
at their destination vertices for r = 3.5s. This value for r 
was chosen so that on average one quarter of the robots are 
waiting at any moment. Lower values of r mean the robots 
are moving more of the time and as a result more frequently 
miss connections, while for higher r the robots spend more 
time stationary which also reduces the rate of convergence. 
With the goal of improving communication, we implemented a 
minor modification to the motion protocol: each robot picks its 
random destination from the cells forming the open boundar)Q 
of its territory. In our implementation, the full partitioning 
loop may take 5 seconds for the largest initial territories in 
Fig. lU We chose to stop the loop after a quarter second for 
this simulation to verify the anytime computation claim. 

The 30 robots start clustered in the center of the map 
between Engineering II and Broida Hall, and an initial Voronoi 
partition is generated from these starting positions. This initial 
partition is shown on the left in Fig. [4] with the robots 
positioned at the centroids of their starting regions. The initial 
partition has a cost of 37.1m. The team spends about 27 
minutes moving and communicating according to the Dis- 
crete Gossip Coverage Algorithm before settling on the final 
partition on the right of Fig. H The coverage cost of the 
final equilibrium improved by 54% to 17.1m. Visually, the 

! The open boundary of Pi is the set of vertices in Pi which are adjacent 
to at least one vertex owned by another agent. 
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Fig. 4. Images of starting and final partitions for a simulation with 30 robots 




5 10 15 20 25 30 



Time (minutes) 

Fig. 5. Graph of the cost ^expected over time for the simulation in Fig. [4] 

final partition is also dramatically more uniform than the 
initial condition. This result demonstrates that the algorithm 
is effective for large teams in large non-convex environments. 

Fig. [5] shows the evolution of H ex pected during the simulation. 
The largest cost improvements happen early when the robots 
that own the large territories on the left and right of the map 
communicate with neighbors with much smaller territories. 
These big territory changes then propagate through the net- 
work as the robots meet and are pushed and pulled towards a 
lower cost partition. 

B. Implementation Details 

We conducted an experiment to test the algorithm using 
three physical robots in our lab, augmented by six simulated 
robots in a synthetic environment extending beyond the lab. 
Our lab space is 11.3m on a side and is represented by the 
upper left portion of the territory maps in Fig. [7] The territory 
graph loops around a center island of desks. We extended 
the lab space through three connections into a simulated 
environment around the lab, producing a 15.9m x 15.9m 
environment. The map of the environment was specified with 
a 0.15m bitmap which we overlay ed with a 0.6m resolution 
occupancy grid representing the free territory for the robots to 
cover. The result is a lattice-like graph with all edge weights 
equal to 0.6m. The 0.6m resolution was chosen so that our 
physical robots would fit easily inside a cell. 

Additional details of our implementation are as follows. 

Robot hardware: We use Erratic mobile robots from Videre 
Design, as shown in Fig. [6] The vehicle platform has a roughly 




providing coverage of a portion of campus at UCSB. 




Fig. 6. Erratic mobile robot with URG-04LX laser rangefinder. 



square footprint (40cm x 37cm), with two differential drive 
wheels and a single rear caster. Each robot carries an onboard 
computer with a 1.8Ghz Core 2 Duo processor, 1 GB of 
memory, and 802. llg wireless communication. For navigation 
and localization, each robot is equipped with a Hokuyo URG- 
04LX laser rangefinder. The rangefinder scans 683 points over 
240° at 10 Hz with a range of 5.6 meters. 

Experiment setup: Our mixed physical and virtual robot 
experiments are run from a central computer which is attached 
to a wireless router so it can communicate with the physical 
robots. The central computer creates a simulated world using 
Stage which mirrors and extends the real space in which the 
physical robots operate. The central computer also simulates 
the virtual members of the robot team. These virtual robots are 
modeled off of our hardware: they are differential drive with 
the same geometry as the Erratic platform and use simulated 
Hokuyo URG-04LX rangefinders. 

Localization: We use the amcl driver in Player which 
implements Adaptive Monte-Carlo Localization 1 251 . The 
physical robots are provided with a map of our lab with a 
15cm resolution and told their starting pose within the map. 
We set an initial pose standard deviation of 0.9m in position 
and 12° in orientation, and request localization updates using 
50 of the sensor's range measurements for each change of 
2cm in position or 2° in orientation reported by the robot's 
odometry system. We then use the most likely pose estimate 
output by amcl as the location of the robot. For simplicity and 
reduced computational demand, we allow the virtual robots 
access to perfect localization information. 
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Fig. 7. Each column contains a territory map and the corresponding overhead camera image for a step of the hardware-in-the-loop simulation. The position 
of the camera in the environment is shown with a camera icon in the territory map. The physical robots are numbered 1, 2, and 3 and have the orange, blue, 
and lime green partitions. Their positions in each territory map are indicated with numbered circles. 



Motion Protocol: Each robot continuously executes the 
Random Destination & Wait Motion Protocol, with navigation 
handled by the snd driver in Player which implements Smooth 
Nearness Diagram navigation l26l . For snd we set the robot 
radius parameter to 22cm, obstacle avoidance distance to 
0.7m, and maximum speeds to 0.4 y and 40-. The snd driver 
is a local obstacle avoidance planner, so we feed it a series 
of waypoints every couple meters along paths found in G(Q). 
We consider a robot to have achieved its target location when 
it is within 20cm and it will then wait for r = 3.5s. For the 
physical robots the motion protocol and navigation processes 
run on board, while there are separate threads for each virtual 
robot on the central computer. 

Communication and Partitioning: As the robots move, a 
central process monitors their positions and simulates the 
range-limited gossip communication model between both real 
and virtual robots. We set r cornm = 2.5m and A comm = 
0.3^. These parameters were chosen so that the robots 
would be likely to communicate when separated by at most 
four edges, but would also sometimes not connect despite 
being close. When this process determines two robots should 
communicate, it informs the robots who then perform the 
Pairwise Partitioning Rule. Our pairwise communication im- 
plementation is blocking: if robot i is exchanging territory 
with j, then it informs the match making process that it is 
unavailable until the exchange is complete. 

C. Hardw are -in-the -Loop Simulation 

The results of our experiment with three physical robots 
and six simulated robots are shown in Figs. [7] and The left 
column in Fig. [7] shows the starting positions of the team of 



robots, with the physical robots, labeled 1, 2, and 3, lined up 
in a corner of the lab and the simulated robots arrayed around 
them. The starting positions are used to generate the initial 
Voronoi partition of the environment. The physical robots own 
the orange, blue, and lime green territories in the upper left 
quadrant. We chose this initial configuration to have a high 
coverage cost, while ensuring that the physical robots will 
remain in the lab as the partition evolves. 

In the middle column, robots 1 and 2 have met along their 
shared border and are exchanging territory. In the territory 
map, the solid red line indicates 1 and 2 are communicating 
and their updated territories are drawn with solid orange and 
blue, respectively. The camera view confirms that the two 
robots have met on the near side of the center island of desks. 

The final partition at right in Fig. [7] is reached after 9^ 
minutes. All of the robots are positioned at the centroids of 
their final territories. The three physical robots have gone from 
a cluster in one corner of the lab to a more even spread around 
the space. Fig. [8] shows the evolution of the cost function 
^expected as the experiment progresses, including the costs for 
each robot. As expected, the total cost never increases and the 
disparity of costs for the individual robots shrinks over time 
until settling at a pairwise-optimal partition. 

In this experiment the hardware challenges of sensor noise, 
navigation, and uncertainty in position were efficiently han- 
dled by the amcl and snd drivers. The coverage algorithm 
assumed the role of a higher-level planner, taking in posi- 
tion data from amcl and directing snd. By far the most 
computationally demanding component was amcl, but the 
position hypotheses from amcl are actually unnecessary: our 
coverage algorithm only requires knowledge of the vertex 
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Fig. 8. Evolution of cost functions during the experiment in Fig. [7] The total 
cost "Hexpected is shown above in black, while H ne for each robot is shown 
below in the robot's color. 



a robot occupies. If a less intensive localization method is 
available, the algorithm could run on robots with significantly 
lower compute power. 

D. Comparative analysis 

In this subsection we present a numerical comparison of the 
performance of the Discrete Gossip Coverage Algorithm and 
the following two Lloyd-type algorithms. 

Decentralized Lloyd Algorithm: This method is from ifTTIl 
and O, we describe it here for convenience. At each discrete 
time instant t G Z>o, each robot % performs the following 
tasks: (1) i transmits its position and receives the positions of 
all adjacent robots; (2) i computes its Voronoi region Pi based 
on the information received; and (3) i moves to Cd(P^). 

Gossip Lloyd Algorithm: This method is from fT9l . It is 
a gossip algorithm, and so we have used the same commu- 
nication model and the Random Destination & Wait Motion 
Protocol to create meetings between robots. Say robots i and j 
meet at time t, then the pairwise Lloyd partitioning rule works 
as follows: (1) robot i transmits Pi(t) to j and vice versa; (2) 
both robots determine U = Pi(t) U Pj(t); (3) robot i sets 
Pi(t + ) to be its Voronoi region of U based on Cd(P$(£)) and 
Cd(Pj(t)), and j does the equivalent. 

For both Lloyd algorithms we use the same tie breaking rule 
when creating Voronoi regions as is present in the Pairwise 
Partitioning Rule: ties go to the robot with the lowest index. 

Our first numerical result uses a Monte Carlo probability 
estimation method from (27] to place probabilistic bounds on 
the performance of the two gossip algorithms. Recall that the 
Chernoff bound describes the minimum number of random 
samples K required to reach a certain level of accuracy in a 
probability estimate from independent Bernoulli tests. For an 
accuracy e G (0, 1) and confidence 1 — r\ G (0, 1), the number 
of samples is given by K > -jrlog|. For 77 = 0.01 and 
e = 0.1, at least 116 samples are required. 

Figure [9] shows both the initial territory partition of the 
extended laboratory environment used and also a histogram 
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Fig. 9. Initial partition and histogram of final costs for a Monte Carlo 
test comparing the Discrete Gossip Coverage Algorithm (black bars), Gossip 
Lloyd Algorithm (gray bars), and Decentralized Lloyd Algorithm (red dashed 
line). For the gossip algorithms, 1 16 simulations were performed with different 
sequences of pairwise communications. The Decentralized Lloyd Algorithm 
is deterministic given an initial condition so only one final cost is shown. 



of the final results for the following Monte Carlo test. The 
environment and robot motion models used are described in 
Section IV-B[ Starting from the indicated initial condition, we 
ran 116 simulations of both gossip algorithms. The random- 
ness in the test comes from the sequence of pairwise com- 
munications. These sequences were generated using: (1) the 
Random Destination & Wait Motion Protocol with qi sampled 
uniformly from the open boundary of Pi and r = 3.5s; 
and (2) the range-limited gossip communication model with 

r , — 9 r vrn , in H \ _ n P comm 

' cornrn — £.0111 anu /\comm — u.o g 

The cost of the initial partition in Fig. [9] is 5.48m, while 
the best known partition for this environment has a cost of 
just under 2.18m. The histogram in Fig. [9] shows the final 
equilibrium costs for 116 simulations of the Discrete Gossip 
Coverage Algorithm (black) and the Gossip Lloyd Algorithm 
(gray). It also shows the final cost using the Decentralized 
Lloyd Algorithm (red dashed line), which is deterministic 
from a given initial condition. The histogram bins have a 
width of 0.10m and start from 2.17m. For the Discrete 
Gossip Coverage Algorithm, 105 out of 116 trials reach 
the bin containing the best known partition and the mean 
final cost is 2.23m. The Gossip Lloyd Algorithm reaches 
the lowest bin in only 5 of 116 trials and has a mean final 
cost of 2.51m. The Decentralized Lloyd Algorithm settles 
at 2.48m. Our new gossip algorithm requires an average of 
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Fig. 10. Histograms of final costs from 10 Monte Carlo tests using 
random initial conditions in the environment shown in Fig. [9] comparing 
Discrete Gossip Coverage Algorithm (black bars), Gossip Lloyd Algorithm 
(gray bars), and Decentralized Lloyd Algorithm (red dashed line). For the 
gossip algorithms, 116 simulations were performed with different sequences of 
pairwise communications. The Decentralized Lloyd Algorithm is deterministic 
given an initial condition so only one final cost is shown. The initial cost for 
each test is drawn with the green dashed line. 



96 pairwise communications to reach an equilibrium, whereas 
gossip Lloyd requires 126. 

Based on these results, we can conclude with 99% confi- 
dence that there is at least an 80% probability that 9 robots 
executing the Discrete Gossip Coverage Algorithm starting 
from the initial partition shown in Fig. [9] will reach a pairwise- 
optimal partition which has a cost within 4% of the best known 
cost. We can further conclude with 99% confidence that the 
Gossip Lloyd Algorithm will settle more than 4% above the 
best known cost at least 86% of the time starting from this 
initial condition. 

Figure [TO] compares final cost histograms for 10 different 
initial conditions for the same environment and parameters 
as described above. Each initial condition was created by 



selecting unique starting locations for the robots uniformly 
at random and using these locations to generate an initial 
Voronoi partition. The initial cost for each test is shown with 
the green dashed line. In 9 out of 10 tests the Discrete Gossip 
Coverage Algorithm reaches the histogram bin with the best 
known partition in at least 112 of 116 trials. The two Lloyd 
methods get stuck in sub-optimal centroidal Voronoi partitions 
more than 4% away from the best known partition in more than 
half the trials in 7 of 10 tests. 

VI. Conclusion 

We have presented a novel distributed partitioning and cov- 
erage control algorithm which requires only unreliable short- 
range communication between pairs of robots and works in 
non-convex environments. The classic Lloyd approach to cov- 
erage optimization involves iteration of separate centering and 
Voronoi partitioning steps. For gossip algorithms, however, 
this separation is unnecessary computationally and we have 
shown that improved performance can be achieved without 
it. Our new Discrete Gossip Coverage Algorithm provably 
converges to a subset of the set of centroidal Voronoi par- 
titions which we labeled pairwise-optimal partitions. Through 
numerical comparisons we demonstrated that this new subset 
of solutions avoids many of the local minima in which Lloyd- 
type algorithms can get stuck. 

Our vision is that this partitioning and coverage algorithm 
will form the foundation of a distributed task servicing setup 
for teams of mobile robots. The robots would split their time 
between servicing tasks in their territory and moving to contact 
their neighbors and improve the coverage of the space. Our 
convergence results only require sporadic improvements to 
the cost function, affording flexibility in robot behaviors and 
capacities, and offering the ability to handle heterogeneous 
robotic networks. In the bigger picture, this paper demonstrates 
the potential of gossip communication in distributed coordi- 
nation algorithms. There appear to be many other problems 
where this realistic and minimal communication model could 
be fruitfully applied. 

Appendix A 

For completeness we present a convergence result for set- 
valued algorithms on finite state spaces, which can be recov- 
ered as a direct consequence of [18, Theorem 4.5]. 

Given a set X, a set- valued map T : X =4 X is a map 
which associates to an element x G X a subset Z C X. A 
set- valued map is non-empty if T(x) ^ for all x G X. Given 
a non-empty set- valued map T, an evolution of the dynamical 
system associated to T is a sequence {x n } ne z >0 C X where 
x n+ i G T(x n ) for all n G Z>o- A set W C X is strongly 
positively invariant for T if T(w) C W for all w G W. 

Lemma A.l (Persistent random switches imply convergence): 
Let (X, d) be a finite metric space. Given a collection of maps 
Ti, . . . , T m : X -> X, define the set-valued map T : X =4 X 
by T(x) = {Ti(x), . . . , T m (x)}. Given a stochastic process 
a : Z>o — » {1, . . . , m}, consider an evolution {x n } ne z >0 of 
T satisfying x n+ i = T a r n \(x n ). Assume that: 
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(i) there exists a set W C X that is strongly positively 
invariant for T; 

(ii) there exists a function U : W — » R such that U(w') < 
U{w), for all ^ G W and k/ G T(w) \ {w}; and 

(iii) there exist p G (0, 1) and k G N such that, for all z G 
{1, . . . , m} and n G Z> , there exists ft G {1, . . . , k} 
such that P [a (n + ft) = z | a(n), . . . , <j(l)] > p. 

For z G {1, . . . , m}, let P* be the set of fixed points of T{ in 
W, i.e., Fi = {w G W | Ti(^) = w}. If x G then the 
evolution {x n } ne z >0 converges almost surely in finite time to 
an element of the set (Pi D • • • D P m ), i.e., there exists almost 
surely r G N such that, for some x G (Pi D • • • D F m ), x n = x 
for n > r. 

Appendix B 

This Appendix proves a property of the Random Destina- 
tion & Wait Motion Protocol which is needed to show the 
persistence of pairwise exchanges. 

Lemma B.l: Consider N robots implementing the Dis- 
crete Gossip Coverage Algorithm starting from an arbitrary 
P G PartAr(Q). Consider t G M> and let P(t) denote the 
partition at time t. Assume that at time t no two robots are 
communicating. Then, there exist A > and a G (0, 1), 
independent of P(t) and the positions and states of the 
robots at time t, such that, for every (z,j) G £(P(t)), 
P [(z, j) communicate within (t, t + A)] > a. 

Proof: To begin, we define two useful quantities. Let 
S(Q) := max max max dp.(h.k) be a pseudo- 

PePaxt N (Q)PiePh,kePi 

diameter for Q, and then choose A := 2^^- + 2r. We fix a 
pair (z, j) G £(P), and pick adjacent vertices a G Pi, b G Pj. 

Our goal is to lower bound the probability that z and j 
will communicate within the interval (t, t + A). To do so 
we construct one sequence of events of positive probability 
which enables such communication. Consider the following 
situation: z is in the moving state and needs time U to reach 
its destination q i9 whereas robot j is in the waiting state at 
vertex qj and must wait there for time Tj < r. We denote 
by t(a) (resp. t(b)) the time needed for z (resp. j) to travel 
from qi (resp. qj) to a (resp. b). Let Ei be the event such 
that i performs the following actions in (£, t + A) without 
communicating with any robot k ^ j: 

(i) i reaches qi and waits at qi for the duration r; and 

(ii) i chooses vertex a as its next destination and then stays 
at a for at least A — t(a) — U — r. 

Let Ej be the event such that j performs the following actions 
in (t, t + A) without communicating with any k ^ i: 

(i) j waits at qj for the duration Tj ; and 

(ii) j chooses vertex b as its next destination and then stays 
at b for at least A — t(b) — Tj. 

Let Eij = Ei HEj. 

Next, we lower bound the probability that event Ei occurs. 
Recall the definition of A com m from Sec. IIII-A1 Since a robot 
can have at most N — 1 neighbors, the probability that (i) 
of Ei happens is lower bounded by e~ XcommrN . For (ii), the 
probability that i chooses a is 1/ |P^|, which is lower bounded 
by 1/ \Q\. Then, in order to spend at least (A — t(a) —U — t) 



A-t(a)-tj-T- 



consecutive times. 



at a, i must choose a for [ 
Finally, the probability that during this interval i will not 
communicate with any robot other than j is lower bounded by 
e -A comm A(A/-2) -phe probability that (ii) occurs is thus lower 
bounded by (1/ \Q\) ^ e -A C om m Aiv Combining the bounds for 
(i) and (ii), it follows that 

The same lower bound holds for P[i2j], meaning that 



¥[E i: 



•[^]P[^]> ( 



J-^ 2 ^^- 2A ™(A+r)AT 

\Q\) 6 



If event E^ occurs, then robots i and j will be at adjacent 
vertices for an amount of time during the interval (t, t + A) 
equal to min {A — t(a) — U — r, A — t(b) — Tj} . Since t(a) 
and t(b) are no more than ^p-, we can conclude that i 
and j will be within r com m for at least r. Conditioned on 
E^ occurring, the probability that i and j communicate in 
(t, t + A) is lower bounded by 1 — e _AcommT . A suitable choice 
for a from the statement of the Lemma is thus 



a = (w\) 



1_^ 2 Tt1 „-2A„(A+r)AT 



(1 



It can be shown that this also constitutes a lower bound for the 
other possible combinations of initial states: robot i is waiting 
and robot j is moving; robots i and j are both moving; and 
robots i and j are both waiting. ■ 

Appendix C 

In this appendix we provide the proof of Proposition 12.81 
which states that any pairwise-optimal partition is also a 
centroidal Voronoi partition. 

Proof of Proposition \2.8\ To create a contradiction, 
assume that P G Part n (Q) is a pairwise-optimal partition 
but not a centroidal Voronoi partition. In other words, there 
exist components Pi and Pj in P and an element x of one 
component, say x G Pi, such that 



d G (x,Cd(Pi)) > d G (x,Cd(Pj)). 



Choose Pj such that for all k ^ j 



d G (x,Cd(P k ))>d G (x,Cd(Pj)). 



(5) 



(6) 



Let s^ b be a shortest path in G connecting a to b and let 

m G SxCd(P-) ^ e me ^ rst e l ement °f me P am starting from 
Cd(Pj) which is not in Pj. Let t be such that m G Pg. 

If m = x, then from ([5]) and the definition of SxCd(P-) we 
have that 

dp 2 (x,Cd(P,)) >d G (z,Cd(P<)) 

> d G (s,Cd(Pi)) = dp^p, (^,Cd(P j )) 

which, since x e Pi, creates a contradiction of the fact that P 
is pairwise-optimal. 

Ifmj^x, then, given ©, one of these two conditions holds: 

(i) d G (m,Cd(P*)) > d G (m,Cd(P,)), or 

(ii) d G (m,Cd(P*)) = d G (m,Cd(P,)). 

In the first case, we again have a contradiction using the same 
logic above with m in place of x. In the second case, we 
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must further consider whether there exists a s_ nA(r> n such 

that every vertex in cd(Pg) * s a ^ so m ^* ^ there is not 
such a path, then 

^ (m,Cd(P £ )) > d G (m,Cd(P £ )) = dp.up, (m,Cd(i>-)) 

and we again have a contradiction as above. If there is such a 
path, then we can instead repeat this analysis using using £ in 
place of j and considering the path formed by this Cd ( P ^ 
and the vertices in s^ Cd ( P .) after m. Since the next vertex 
playing the role of m must be closer to x, we will eventually 
find a vertex which creates a contradiction. ■ 
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